


Institutional Archive of the Naval Postgraduate School 





Calhoun: The NPS Institutional Archive 
DSpace Repository 


Theses and Dissertations 1. Thesis and Dissertation Collection, all items 


1972 


Application of parametric time series analysis 
methods to descriptions of wave-induced 
fluctuations in the adjacent airflow. 


Smith, Craig Howard. 


Monterey, California. Naval Postgraduate School 


http://ndl.handle.net/10945/15998 


Downloaded from NPS Archive: Calhoun 


Calhoun is the Naval Postgraduate School's public access digital repository for 
| (8 D U DLEY research materials and institutional publications created by the NPS community. 
«ist sia Calhoun is named for Professor of Mathematics Guy K. Calhoun, NPS's first 


NY KNOX appointed — and published -- scholarly author. 

ia) LIBRARY Dudley Knox Library / Naval Postgraduate School 

411 Dyer Road / 1 University Circle 
Monterey, California USA 93943 





http://www.nps.edu/library 





bd 


APPLICATION OF PARAMETRIC TIME SERIES ANALYSIS 
METHODS TO DESCRIPTIONS OF WAVE- 
INDUCED FLUCTUATIONS IN THE ADJACENT AIRFLOW 


Craig Howard Smith 








DSTORADUATE SCHOOL 


onterey, Gailfornia 


catltiooees 
oe 
Donel 
oe 
| ew 
<= 


wioieecatonmmoterarame. ric Lime Series Analysis 
Methocs to Deseriptions of Wave- 
Induced Fluctuations in the Adjacent Airflow 


by 


Craig Howard Smith 


Mrs say LSsor : kK. la Dayidson 





September 1972 


Approved for public release; distribution unluncted. 





Application of Parametric Time Series Analysis 
Methods to Descriptions of Wave- 
Induced Fluctuations in the Adjacent Airflow 


by 


Craig Howard Smith 
Ensign, United States Navy 
AGbseeeranceton University, 19/1 


Submitted in partial fulfillment of the 
requirements for the degree of 


MASTER OF SCIENCE IN METEOROLOGY 


from the 


NAVAL POSTGRADUATE SCHOOL 
September 1972 





* 
Faas : a 





ABSTRACT 


Certain methods of analysis for testing the hypothesis 
that organized motion exists in the airflow above naturally 
occurring water waves are examined. In particular, two cur- 
rent methods, spectral analysis and joint probability density 
function analysis, are briefly discussed; two new methods, 
Matched filters and parametric time series analysis, are 
Suggested. 

An analysis is done with parametric time series analysis 
On wind-wave data. The results show the tractability of 
this data to parametric methods, and the results are in 
agreement with previous spectcal analyses of the data in the 


regions of overlap. 
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eee LT RODUCT LON 


In recent years, interest in parameterizing and defining 
processes in the air-sea boundary layer has intensified. Re- 
cent theoretical models [Miles, 1957] postulate, among other 
things, that the airflow near the surface and the waves are 
coupled in such a manner as to produce organized motion in 
the air in relation to the wave field. This results in air 
motion with the same period as the dominant period of the 
Waves. This paper examines certain methods of analysis by 
Which these hypotheses can be tested. [In particular, two 
current methods are discussed, two new methods suggested, and 


an analysis using one of the new methods is performed. 


Jen 





II. BACKGROUND 


Heponsalizedqsmotiton exists in the air due to the presence 
of the underlying waves, its existence and nature should be 
reflected in observations of the wind over the waves. Unfor- 
tunately for a prospective analyst of such observations, this 
regime is characterized by the presence of a general turbulent 
field which serves to mask the organized motion. The problen, 
then, is to discover whether organized motion does exist, and 
Mmemeedoes, LES nature, in the midst of random turbulent in- 
fluences. This Should be determined from data obtained over 
Natural waves. 

It is the usual case that data of interest take the form 
Smeescimultaneous continuous time traces of variables of inter- 
est, like wind components and wave heights. For purposes of 
ease of analysis, such records are usually subsequently dis- 
cretized so that the final form is that of a multivariate 
discrete time ee eis: This is the case with data used in 
this paper. That data was obtained from observations over 
natural waves on Lake Michigan; the data are fully described 
elsewhere [Davidson, 1970]. 

Time series differ from other types of statistical sample 
realizations in that the observations are assumed to be de- 
pendent upon one another, whereas in the larger body of sta- 
tistical technique, it is preferred that observations be 
tmcdeneomdenmtw ithe exploitation of this expected dependence 


is a distinguishing characteristic of time series analysis. 


WZ 





PV eromoiece sted) Dy Ehe form of the data, the majority of the 
techniques considered here are based on time series analysis. 
One method, that of joint probability distribution function 
analysis, is not drawn from the body of time series analysis, 
but it also uses the dependent nature of the data to advan- 


tage, though indirectly. 


Ewe OE ECTRAL ANALYSIS 

Most widely known of the time series analysis techniques 
is spectral analysis. This technique examines the data in 
the frequency domain. The most common representation of spec- 
tral results is the sample spectrum, whose magnitude for a 
given frequency reflects the amount of variance in the data 
which can be accounted for by the presence of a cosine compon- 
ieee the given frequency. It is, in fact, the Fourier 
Cosine transform of the estimated autocovariance function. 
Hence, the spectrum of data arising from a process dominated 
by a periodic effect should exhibit a marked peak at the fre- 
quency of the periodic effect, while the spectrum of white 
noise should be flat. Thus noise should raise the level of 
Pe sapectrum, but should not obscure significant peaks. Gen- 
ST ne icmaypobmesized relationship hold, one would ex- 
pect a peak in the wind component spectra at the same frequency 
as the major peak in the wave spectrun. 

Spectral analysis also has a natural extension into multi- 
Po eecmo Dace mcatled Cross-spectral analysis. As the co- 
variance of wind and waves is of primary interest here, 


calculation of cross-spectra should reveal some information 
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of interest, including estimates of phase relationships, co- 
Merence, and Cross-covyariance with respect to frequency. Pre- 
d@ietions from theoretical models for these yalues may then be 
compared to the estimates resulting from actual observations 
Temaneindication of the validity of the theory. 

Spectral analysis is not without major drawbacks, however. 
A major handicap is that it can only deal adequately with 
Seat ionlary time series. Often natural time séries are not 
observed to be stationary, and may require elaborate algo- 
rithms to be applied in an effort to remove trends and/or 
seasonality to obtain stationarity. A second difficulty is 
the lack of smoothness in actual sample en ee A typical 
sample spectrum is so jagged that interpretation is difficult. 
The usual remedy is to smooth the data, smooth the spectrun, 
Or both. This makes the information more readily apprehended, 
but the complex effect of such smoothing on individual esti- 
Hateseot spectral Ordinates makes the statistical significance 
of the estimates difficult or impossible to determine. It 
iaveraeso result in a loss of information contained in the 
data. Further, while confidence intervals for unsmoothed 
estimates of spectral ordinates may be calculated theoretical- 
ly, theory also shows that the error in the estimate of the 
spectral ordinate will be of the same order of magnitude as 
the spectral ordinate itself [Kendall and Stuart, 1966]. 
These facts make it difficult to quantitatively support the 
belief that a particular peak in a smoothed spectrum is 


See tiecaniteeeticmnendall! and Stuart [1966] rightly caution 
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meteorologists and oceanographers by name about assuming the 
Peeiotemece Of periodic elements in the data generating process 
Solely on the basis of spectral analysis; care is necessary. 
Further discussions of other possible pitfalls as well as much 
more complete discussions of spectral analysis are to be found 
in Blackman and Tukey [1958], Kendall and Stuart [1966], and 
Jenkins and Watts [1968]. 

Although unsatisfying in some statistical senses, spectral 
analysis is a useful tool to be applied to looking for wind- 
wave coupling. The data used in this paper was extensively 


analyzed spectrally by Davidson [1970], with good results. 


B. MATCHED AND WIENER FILTERS 

Another type of analysis related to spectral analysis 
Which may hold promise for the wind-wave coupling problem is 
to be found in the repertoire of electrical engineering. 
Electrical engineers are frequently faced with the problem 
Gameaetecting and isolating a signal in the wide at noise, 
Pececrtoely analopous to Looking for organized motion in the 
air in the midst of turbulence. To solve this problem, two 
types of spectrally derived filters have been developed, de- 
Signed to give a large response to signal and a small res- 
POemeeoumtOrce . slney are matched filters and Wiener filters. 
Pomme mveste Knowledge of the author, no one has yet applied 
such filters to wind-wave data looking for organized motion, 
but Sokol [1971] applied this technique to detection of ther- 
Mal plumes in the air adjacent to the sea from temperature 


traces of a type similar to the wind-wave data. His results 
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Weine Quite encouraging for the hypothesis that such an analy- 
Beommteie De useful in Ehe detection of organized metion in 
turbulence. 

At present, the major problem with the application of 
these techniques to Wind-wave data appears to be sensitivity 
to small errors in estimates of certain parameters which are 
difficult to estimate accurately, like the phase of the per- 
ijodic effect. However, the magnitude of such problems is as 
yet unknown, and does not preclude at least a comprehensive 
preliminary examination of these techniques for practicality 


with respect to the wind-wave problen. 


ee OLNT PROBABELITY DENSITY FUNCTION ANALYSIS 

The one analysis technique currently used in this field 
which is not properly a member of time series analysis is 
Homme probability density function analysis. Basically, this 
Pes mines lee daea inthe probability domain. In the 
analysis, the probability density function of Sreeree of 
joint occurrences of two variables is calculated. Equal 
joint probability density contours are plotted on a two di- 
mensional array. The contours are then subjectively analyzed 
for deviations from the concentric circle pattern of a theo- 
Pic radon, Independent joint probability density func- 
tion, the current reference function being the bivariate 
CetmdcamiewOormalejoint probability density function: The 
pattern of these deviations, which is assumed to arise be- 


cause of the dependent nature of the time series, is then 
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evaluated in terms of the generating process and compared to 
Wideets expected by theory. This type of analysis has been 
applied to meteorological data by Holland [1972], Dayidson 
amderrank [1972], and Frank [1971]. 

This method shows great promise; however, it lacks refine- 
ment as yet. The choice of the bivariate standard normal 
Beaman noObabDitity density Tunction as a reference distribution 
is formally incorrect, although it may be satisfactory in 
cases involving many degrees of freedom. The proper choice 
momadnbivariatLe distribution analogous to the Student-t distri- 
Muetomein the univariate case. Using such a distribution, 
described in some detail in Birnbaum [1962], it appears quite 
feasible to develop an objective test of the significance of 
deviations from the distribution expected if the generating 
processes were random and independent. Further tests against 
Wem a Leterence distribution reduce, effectively, to t-tests 
for correlation coefficients and regression coefficients cal- 
culated between the two variables [Birnbaum, 1962, pages 223- 
242]. It would appear beneficial to include at least these 


last tests as a normal part of the analysis. 


Dee AMEE RIC TIME SERIES ANALYSIS 

The final method is that adyanced by Box and Jenkins 
[1970], called by some, parametric time series analysis. 
pomuetnod tivolyes fitting a stochastic model to the time 
serles, while allowing, in keeping with modern statistical 
thought, the data themselves to shape the model and the analy- 


Sis itself to a relatively great extent. This method is 
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relatively new; its capabilities and limitations are not con- 
mon knowledge in the manner that those of spectral analysis 
Pouce tn order to better understand these capabilities and 
Pemetations, particularly as they apply to the boundary layer 
data in this study, it is necessary to discuss the form of 
the models to be fitted. This is done in Sections 1 through 
3, which describe, though with considerably less detail, the 
ideas expressed by the primary SOMRee. Box and Jenkins’ text 
meg 70). alee th is background already given, the particular 
applications to wind-wave coupling investigations are dis- 
cussed in Section E. 
ieee Ss Proposition | 

Probably no process generating experimental observa- 
BeOnce ts Completely deterministic. Thus, deterministic 
models are not usually adequate in analyses, although physi- 
cists and others have successfully used them in analyses of 
data generated by an apparatus designed to be so accurate 
that noise contributions to the observations are negligible 
in relation to results of interest. But if the noise is of 
a Significant magnitude, as with turbulence data, models in- 
cme MOLSe must be used. If the probability structure of 
the process is of interest, as in the case of atmospheric 
turbulence studies, the normal course is to use a stochastic 
model. The model fitted by parametric time series analysis 
Homes rochastic. 

Central to parametric time series analysis is Yule's 


proposition [Yule, 1927] that a time series whose elements 


iss 





are highly dependent, as is the case with wind-wave records, 
Piweiccttiry spe Viewed as having axisen from the application 
Seed binear £Llter to a parallel series of independent random 
shocks drawn from a fixed distribution. This yiewpoint is 
purery heuristic, and need not correspond to the actual pro- 
cess at all to be useful. The proposition is symbolized in 
Figure la. 

PMimappuyine Lhis idea, the fixed distribution of the 
series of Sinn shocks will be taken to be normal sen mean 
zero and variance of - With the random shock series repre- 
sented by a's, the observed time series by ZS; and the 
linear filter elements by W'S, the relationship can be 


symbolized as 


eee ey Vote > 


liek (1 + v,8 + Bo pa enie J 0 a 


C 
=u + WCB) a, CZ 
where B is the standard backward shift operator, and 
Bex. = x (an 


E t=s 


If the sequence W(B) is finite, or infinite and convergent 
for B given a value of unity, the filter is said to be con- 
vergent. Memmrecortbter ms convergent, then z(t) is a station- 
ary time series, and is the mean of the series. If the 
Peers Unstable, 2 is an arbitrary reference point for the 
process level, and z(t) is non-stationary. 
ous The ARIMA Model 
Parametric time series analysis is based on fitting 


a particularly useful subset of linear filters of the kind 


Wg) 





Gee vtle  s Mode 1 





b. ARIMA Model 


ARIMA w(B) 





0(B) p yi 


MA Filter gy ae pers "Difference" 
Pik ter 






Cc. Transfer Function Model 


a! e 
UCED ia 7 C0 CB) 


bimear Filter 





Figure 1. Filter Models 
- after Box & Jenkins [1970] 
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Mmce described. This class of models is known as the autore- 
gressive integrated moving ayerage (ARIMA) class. As the name 
suggests, this model incorporates three separate features, 
the autoregressive terms, moving ayerage terms, and integra- 
meen operator. The model is easier to understand if one dis- 
cusses each part separately, Ditdtiemene total SEructure as 
new features are discussed. 

The autoregressive model associates the current value 
of the Bree asia Eli a linear combination of past values of 
the process and a current random shock. An autoregressive 


process of order p, AR(p), would be: 


Land ~e 


26 > 4 Ze 4 + . Zot. et d a (2.3) 


Z + 
p t-p C 
where Ze =z, - wu. If one considers the past values of ae aS 


the "independent" variables, then one has the standard linear 
regression relation, hence the name autoregressive. The p+t2 
parameters: eee oe ae » must generally be estimated 
Giececkiy from the data. 

Biaeomcleamr that by Substituting back within the 
model (Equation 2.3) for the past values used by the model, 


See ar 


Ze Oh Eg ao eo eae W) + a 


= (2.4) 


aoe 
one can eventually produce a representation of the AR(p) pro- 
e€Ss as an infinite series in the a's, and hence, as a 

Pimeareeritter of the type described previously. Writing the 


Ar(p) process as 


C25) 


BDz. sae 


ZL. 





and 


y(B)a, = 2, Ch) 
then one obtains 
CB) = w(B) Cap 


Wech Ehe same constraints for stationarity of wW(B) as noted 
previously. 

The second important process is the moving average 
process of order q, MA(q) which relates the current value of 
z. Bompast values of the a's. The MA(q) process may be re- 
presented as 
oe a2 o (Zap 


Writing the MA(q) process as: 


Zz. 5 og Se a> C229) 
it is immediately seen that this process already has the 
form of a linear filter where 

WCB) = 5 22 C25. 40) 


In a formal statistical sense, the form described above is 
not confined to moving averages of the a's, since the 6, 's 
need not be positive, nor sum to one, but this is standard 
nomenclature nonetheless. The qt2 parameters of the model: 


O4> Go reees ren lls ae again generally depend directly on 


a 
bitema@daia for their estimation. 

iWese two processes may be mixed to achieve parsimony 
of the model (discussed in a later section). The result is 


an autoregressive moving average process of order (p,q) that 


PonmesriA. (p,q): 


IL 





Or 


Po ) ZF a3 aie C2512) 


Hemea total of ptq+t2Z parameters: Py reees Oo: Op ocees Giants, 
o.. Meare required, and in general must all be estimated 
mimectiy from the data. If q is finite, Bg O22 is a stable 
mierter. For the equivalent filter of the ARMA (p,q) process 
momme stable, then, it is sufficient to require 6, B) to 
be stable, as the ARMA (p,q) equivalent filter is a simple 


iMmrinmite series: 


, eee 
WCB) = Oe (B) J, MBy (2.13) 


Famehem. tt is clear that the ARMA (0,q) model is equivalent 
to the MA(q), and the ARMA (p,o) to the AR(p) when all are 
applied to the same 2 and a's. 

For reasons of simplicity and stability of calicula- 
Bites useful to require that the only ARMA(p,q) proces- 
ses considered be those which are stable, and hence would 
result in a stationary z(t) if applied to a sequence of ran- 
dom shocks, a(t). But many observed time series are inade- 
quately fitted by such an ARMA (p,q) model because they are 
Maat comity sim mature. Many of these may be adequately 
fitted by using a generalized autoregressive operator P (B) 
iiepiwace of vse The result will be an autoregressive in- 
tegrated moving average model. This is shown schematically 


in Figure lb. 
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APC cmoMowm striate the roots of the equation 


Q 


Oe? 
2 Pp | 
La 4 [Bais %> Re ae ve B C2214) 


H 


where B is now considered in the same fashion as a variable 
Polynomial, mist bie outside the unit circle for b, (B) 
Mempe Stable. Lt can further be shown that if any of the 
Beots iiées inside the unit circle for either 


0 


8) 2.15) 


0 


it 


Te C2276.) 
then the ARMA(p,q) model exhibits explosive growth, thus vio- 
lent non-stationarity. However, it turns out to be very use- 
Pitt reating many types of nen-stationarity to allow one 
Or more of the roots of 
P(B) = 0 C27) 
Mmeomedudi to unity. Letting the V operator symbolize this: 
Ve (1 - B) (2.18) 


one can then write Y(B) as: 


P (B) 


d 


oy by V 


(BY (1-B)° (2.19) 


where d is the number of roots of 9Y(B) set equal to unity. 
Thus, the autoregressiye integrated moving average model may 


be written: 


MN 


a8) Ze 0B a (2520) 


t 
or 


MN 


6 (B) Vo z 


Pee o AS 
5 F Bg Pe a ( ) 


xe 
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ipsemodel can be symbolized by ARIMA (p,d,q). Substituting 

Wee a Z (2.22) 

produces a process in the wens which can be represented by a 
stable equivalent filter, since the process may be written 

ey MN = ee “2.,23> 


'"s. This re- 


which is a simple ARMA (p,q) process in the Wes 
veals the types of non-sStationarity which can be handled. In 
Beamer for Equation (2.23) to be true, op) must be a station- 
ary series. ee any series z(t) which can be reduced to 
Seaerotarity by forward differencing by the vd Operator ean 
be adequately fit by such a model... This includes any series 
whose "trend" can be adequately approximated by a polynomial 


of degree d or less. 


Wemnmete thac the term integrated arises because 


af C 
V Z = SZ. = each (2.24) 
and, for example 
3 C LS t 
Sir =e) u ee C2223) 
[c - ° h ® 
1=-©& D)) eee Se h=-© 


Pius in forecasting a future value of Zi.» Say z one uses 


Ete’ 
the summation operator to go from the predicted and derived 


? ° 
w. 8 EOmEne predicted Zz 4] 


S. 

There is still one important characteristic that a 
Pimicerceribecemay stave that 1s useful to be able to handle: 
seasonality. Many treatments of seasonality are possible 


Within the framework exhibited so far. This discussion will 


be limited to the multiplicative technique described below. 
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Box and Jenkins [1970] should be referenced for more detailed 
fronmablLon On both this and other techniques. 

The multiplicative technique is a simple extension of 
the ARIMA (p,d,q) model. The basic assumption is that the 
variation between observations separated by an increment of 
time equal to the period of the seasonal component can be 
adequately fitted by the same sort of models as used on adja- 
cent observations. Hence, replacing B by B° in the ARIMA 


(p,d,q) model, we get a seasonal representation: 


S D~ _ S 
O, (B ) a Zoe CG, (3 ) a. @ 2.26) 
alp 
s-l s°P 
nes = Co,8 sae +O,8 ) ve 
2 ssl s°Q 
(O,8 +...+ Gp B ) Ae eV C7) 
or We = i es ce POW peg eas. 
~@2 _ a 
(eee OW oes (2.28) 
where 
2 ee ee ae 
Wir = We z= (1 B’ ) Ze 


Diwteehieis also clear that the process will also de- 
pend on adjacent values, so the complete multiplicative model, 
Gicm uw MAGp.d,q) x (P,D,Q) is: 


Seo ets 
b, (BO, GV, 2, = 9,8) @ 


S 
é (CB ye (2329) 


Q 


This result is a very powerful class of models for fitting 
naturally occurring time series. This is the class upon 


Which parametric time series analysis is based. 
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Ss. ltranster Wunetion Models 
Bowetnomienminaestl970) further discuss transfer func- 


tion models designed to fit cases in which two processes are 


related by 

(1HE, Dt...+2,D") y(t) = (H,+H, Dt...+ H_D°)X(t-t) (2.30) 
il R oO} I S 

where D = d/dt, the ='s and H's are unknown parameters, and 


T is the dead-time or delay in the system. Where Y, and Xe 


are discrete time series measured at equispaced times, (2.30) 


becomes 
Yr 7 Ss 
(1+E,V4...48 V9 i = (n tn, Vt..-tn ae G2e. 3.10) 
Where D is replaced by V = 1-B as before. Substituting 
B = 1-V, we obtain 
Yr Ss 
(1-6, B-...-6 B )¥, = (09 w,B-...-w B eae. 
or 
_ b 
6(B)Y, = w(B)B xX. 
= 2(B)X a IR 


Pomiuenne ARIMA (p,d,q) case, this relation may be written in 
EiteetOnrme of a linear filter: 
% = 6~*(B)Q(B)X, > UE A (nce) 

which is stable if the v(B) series converge for B given a 
numerical value such that 0 < | B | ee cs bOmDe noted Lhat 
the transfer function model and the stochastic ARIMA models 
are somewhat parallel. 

Now, remembering the discussion on the lack of deter- 


Ministic time series, it is necessary to consider a stochastic 


transfer function: model by adding superimposed noise, Nos 


ALY 





Bowume process, where NG and x. are independent. Hence, we 
obtain 


Ln = vy (B)xX, + Ny (2.34) 


for the stochastic model. But there is no reason to suppose 
that the process does not effect the noise in some way, or 
that the noise contribution is unstructured. Therefore, the 
idea is advanced that the noise, Noe will be represented as 
having the form 


N= y(B)a.= @-* (B)e(B)a, (535) 


Mae Eemenemynsn Of semand O's are as before. This form includes 
unstructured noise, the raw random shocks, a's, as a single 
Subset of the possible structures. Hence, the model one would 
be led to fit is 


as v(B)X +0 (B)a, 


: 57" (B)Q(B)X, + P~* (B)O(B)a, (2.36) 


This is shown schematically in Figure lc. 


E. APPLICATIONS TO WIND-WAVE COUPLING 

For an application of these techniques to the problem of 
teeect Monro tmniypothesized organized motion in air adjacent to 
Natural waves, it seems clear that a transfer function stoch- 
anemic homitKe ly oto giye the most direct information as 
EO) Ene Structure of the relationship between the waves and 
Pree adgacent adirfilow, ©One would fit the transfer function 
model by regarding the wave heights as inputs, Xo and. a wind 
VerocrtEyecomponent as the output, Yi The existence of signi- 


ieatitemalues for the vy s would immediately suggest that there 
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Pees mick tat lomo 11 the overlying airflow which are not random 
Mmmeormattomeeo the waves, particularly if such yalues appear 
mepeatedly £or data from different obseryvatsonal periods» It 
omar OLtumate Cclreumstance that statistical tests of signifi- 
eemee for these parameters exist. However, the accuracy of 
these tests depends on how closely the parent distribution of 
fmematectEuatily calculated random shocks corresponds to the dis- 
tribution picked to be the theoretical parent of the random 
shocks in the model. The Structure of the fitted model may 
give clues as to the nature of the actual process. It is an 
unfortunate circumstance, however, that the transfer function 
models are beyond the scope of this paper. 

The analyses performed were involved fitting ARIMA (p,d,q) 
Moen Oe) somodels Lo individual time series. Three benefits 
may be possible from this procedure. First, models fitted 
individually to wind components and wave heights from the same 
observations may be compared for suggestions of interaction. 
This type comparison is difficult, however, because many indi- 
vidual models with differing orders in the various types of 
processes may be quite similar in result and fit, depending 
Cie nieCmadea. ss hence, for example, oné may not be able to say 
that an ARIMA (1,d,0) x (0,0,0) represents data with a differ- 
Jeo eu ChE LCm@EnanmeEnalbwgr: a Series best fit by an ARIMA 
Gime 2) sol OnnO)y the Patter arises because these two models 
CoOSe bye cOLrespond for certain eee Piya luesmof the para- 


meters involved. 
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poewaerrbpenerit is that the fitted model may suggest in 
Mecsas pOsotble structure for the process. Also possible 
amaemonre likely is the fact that repeated analyses of this 
Mmandemay help a researcher following an adaptive model build- 
ing strategy to infer a general model for given conditions. 
Peometnisc, he may inter some of the structure of the parent 
process. The researcher is helped in this quest by the sta- 
meotrecal confidence intervals it is possible to obtain for 
parameter estimates and joint confidence intervals for groups 
of parameters (with the same caution here as presented in the 
transfer function model discussion directly above). 

Third, and seemingly most important, this study may give 
an idea of the usefulness of these techniques in fitting to 
boundary layer data. The study intended to give an indication 
whether such data is of the types that this sort of analysis 
can handle, for pre-evaluation of the usefulness of further 
Studies along these lines. 

Moeilaatimenotcuah interest is that Yule's proposition [Yule, 
9271, discussed in I-D, may have a physical significance for 
Wind-wave data. It may be possible to oe the generalized 
turbulence of the airflow adjacent to natural waves as a 
parent random process producing a series of a 'Ss and the 
mov cComacmactingelike a2 linear £ilter on these a's EOmprcauce 
Papa ecemotlomeamd a resultant structuring of the noise 
elements, that is, the obseryed erence: hegent ppapexs) by 
Reynolds [1968] and Davidson and Frank [1972] suggest that 


PNeCmsotruMetire of the turbulent field is PMporkant in many 
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Peers Or alnt-sea interaction. Hence, parametric time serles 
ei cloemayeMloldmpredgt potential in boundary layer studies. 
However, tO reiterate a preyious statement, it ig not neces- 
Meyechat there Exist any correspondence at all between the 
Zee cenerating process and the Yule viewpoint for this type 


Mieanalysis to be beneficial. 


Sal 





ote olo GCONSTDERATIONS 


The data considered were from an August 19 observation 
period, described in Davidson [1970]. From these data three 
time series were chosen: wave height, u-component of the 
wind 1.5 meters above surface, and w-component of the wind at 
1.5 meters. ARIMA models were fitted to each of these series, 
and examined for goodness of fit. The models were examined 
mMOoneLnitOrmation Contained in them about the structure of the 
time series to which they were fit. 

The primary feature of parametric time series analysis 
is model fitting. An iterative approach to model building 
was used which allows the data to influence the model struc- 
mime lhe Steps in this procedure are discussed in Section A. 
Again, a more ee discussion of this procedure is found 


mmbox sand Jenkins text [Box and Jenkins, 1970, p. 18]. 


A. MODEL BUILDING 

In fitting ARIMA models, an iterative model building pro- 
eedtine 1S Most fruitful. The steps of such a procedure are 
described below and the process is summarized in Figure 2. 

Pee select tomeor Ceneral Class of Model 

Premwrtactestepean model building is to postulate a 

Poem ieclaccmor models sto be fitted, By choosing to perform 
Daramettte Cime Series analysis, ne researcher has implicit- 
ly chosen to use the ARIMA class of models. However, further 


choices are possible at this stage. Due to the very nature 


a2 





Selection of General 


Class of Models 


identiLiicarion 
OL 
¢ Model 


EBStimaeLon 
Of 
Parameters 





End 
Process 


Test 


Goodness of Fit 





Good Fit 


Bad Fit 
Accept Model 


Pitre Model) Budlding 
- after Box & Jenkins [1970] 
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Sete problem, One 1S yirtually forced to consider the 
seasonal extension to the ARIMA (p,d,q) model. 
Paoeeeaentiricakiton of Model 

MicmocermGmea tap imeEne procedure 1S the Edentification 
of a particular model or subset of models from the general 
mass chosen. This process can be influenced both by outside 
Knowledge and information contained in the data. 

Have osen to COlsider an ARIMA Co dsp x €P.),0) 
model, theory suggests that the order parameter d should be 
zero, and D should not. The need for d#0 would imply a notice- 
able change in the mean sea level was taking place during the 
period of observation. The need for D=0 would imply that no 
seasonal change in the time series at all was taking place 
ftir the period of observation. Seasonal variation certain- 
Myedoes occur in the waves, ane we expect seasonal variation 
io the air. 

It can be shown that the stable non-stationary opera- 


pee eee a) ey, ...s—1) The 


S 
EOr ee (1-B ) has zeroes at 
application of this operator to a time series will remove a 


seasonal component of the form 


[s/2] 
seasonal component = bet 2 


{b, cos (=24) +p Sin ey} 
. 5 Ss 27 Ss 
faa ee 
Witememtinenb S are adaptive coeificients implicit in the data, 
Smee G/2 1 GS 2s eyen, or [$/2}] = 1/2 (S=-1) if S is 
odd. This type of operator is a parsimonious representation 


of seasonal components of a type which require many sine and 


cosine waves to represent them adequately. An example of 
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such a seasonal component would be a spike at periodic in- 
bervals. 

tmeervapreoatets thar Eke main seasonal effeet will 
have the form of a single sine wave at the frequency of the 
Meter waves. The ve Sucradtean ts NOL 2 ParSimoni1ous repre— 
sentation of a single sine wave. Furthermore, the adaptive 
coefficients of at least some of the additional sine and 
cosine waves contained in the ve operator are likely to be 
inflated by intermittent and statistically non-significant 
periodicities in the random turbulent component of the motion. 
Therefore, an operator representing a single sine wave at the 
desired frequency, adaptive in phase and amplitude to the data, 
Was applied in place of the ve operator in some models. This 
eperatoreis [l=—2 cos (21/p)B + Ba ae where p is the desired 
period. This stable non-stationary operator, which has zeroes 


ati2t/p 


at » will remove a seasonal component of the type: 


seasonal component b sin(=")+ b 


2 
1 cos = ) 


2 


ee 
db, LNs teOn) 


where the b's are adaptive coefficients, 6 is an adaptive 
phase angle, and p is the desired period. 

Lous ruOnyets sa primary consideration at this stage. 
Peomeoimony 1nvolves Making use of the most efficient type of 
model possible. For example, while an AR(1) process can be 
theoretically represented by an infinite MA process, and by a 
Pieces quem laes process in practice, it is most parsi- 


moniously represented by the AR(1) model involving only one 
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parameter as opposed to an MA(q) process involving q > 1 para- 
meters. Similarly, an AR(p) model is not a parsimonious re- 
presentation of an MA(1) process, nor is either an AR or MA 
Medel a2 parsimonious representation of a mixed ARMA process. 

HeOMCeyeeuGwaG necessary to go to the data for an indi- 
mammoweor the values of p,q,P, and Q which are best fit, 
memembering that in practice a value of two or less for each 
@S mTOormally sufficient. To get this indication, one examines 
the autocorrelation function and partial autocorrelation 
mam@etion. A brief discussion of what one looks for is to be 
found in Appendix A. Estimates of p,q,P and Q are made, and 
Mmm estimates of the fitted parameters, the o's, O's, 6's, 
and @'s, are made. 

Se estimation 

tiomenend Stace. of the procedure is to fit the model 
to the data. The rough estimates of the fitted parameters are 
now used as initial guesses for iterative estimation routines. 
At this stage, the best fit for the model chosen is obtained. 

4. Diagnostic Checks 

The final stage is diagnostic sheekilae to determine if 
Bier Gedemodelsis adequate. If it is, the procedure ends, 
and the model is chosen. If the model is determined to be in- 
areqduatemotmeuisdtistactory, the process is iterated from the 
ZeeoudmoLepeulett a suitable model is either found or deter- 
mined to be non-existent. 

A more complete discussion of the diagnostic tools 


Mse@dena sO be found in Appendix B. 
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Bee 2 REL LMENARY ANALYSES 

Sicmpacmeminanyeatalysis produced estimated autocorrela- 
MEO uTetLoOm values, GStimated autocovarliance function values, 
ealteeactimated partial autocorrelation function yalues of the 
mEmer series which resulted from the application of the voy 
Mperatonr, with various values of d and D, and appropriate 
Mme s Or S$, £0 the initial data. These estimated function 
values were also calculated for the time series resulting when 
ve was replaced by [1-2 cos (271/p)B + ao the less general 
Operator discussed in A-2. From these estimated function 
values, the proper types and degrees of differencing needed 
Momoroduce a Stakionary time series were estimated. Further 
analysis of the estimated function values for the properly 
differenced series resulted in estimates for the proper values 
of the parameters p,q,P, and Q, for the multiplicative ARIMA 
womded) exe Cr. DlO} model to be fit. The set of the most likely 


model candidates was the result. 


C. ANALYSES 

The preliminary analysis yielded the set of the most like- 
ly model candidates; the parameters required by these models, 
Bhe Gs, © S, Gtc., were er eae by mapping the maximum 
[Piromenoctmoirrace Of the parameters, as reflected by the sum 
of the squares of the estimated a's (that is, the sum of the 
Squares of the residuals). Thus, .these estimates were "least 
Squares" estimates, and this method is an accepted routine 


for non-linear least squares estimation. Diagnostic checks 
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were then applied to the a's resulting from the use of the 
best estimates Lox the parameters in each model. These diag- 
Mestihe checks included comparing the magnitudes of the sums 
of the squares of the residuals of different models, calculat- 
ing the autocorrelations of the residual series, and calculat~- 
ing a chi-square test for adequacy of fit [Box and Jenkins, 
1970, p. 291]. These diagnostic checks were then compared in 
an effort to determine the best model of those examined. 

Unfortunately, the author was unable to get a Marquardt 
maximum descent nonlinear least squares algorithm to work. 
MeaGeethas been done, reasonably accurate confidence intervals 
and joint aeons eee for parameter estimates would 
have been easily available in addition to the speedier arri- 
val at the least squares estimates. As this was not done, 
alternate methods for obtaining these estimates, set forth 
mimnoxeand Jenkins {1970, p. 228], were explored. Consider- 
ing the lack of real need for such interval estimates in this 
study, due to the small number of time series analyzed and 
subsequent preclusion of a meaningful attempt at an adaptive 
model building strategy, these methods aes found to be too 
inefficient and time consuming to be of sufficient relative 
Value to attempt. Hence, although no confidence intervals 
Mememect inated, stheir exclusion was not significant in this 
study. 

In summary, the estimated autocorrelation functions, 
estimated autocovariance functions, and estimated partial 


autocorrelation functions of the time series resulting from 
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mpeeeatton on Ene difference operators to the original time 
series were used to identify the proper degree of differenc- 
Pieeciamelewmaat likely types of models to be fit to the pro- 
perly differenced series. The parameters of these models were 
estimated by a nonlinear least Squares mapping technique. The 
residuals, the ans resulting from each type of model when the 
least squares estimates for the model's parameters were used, 
were tested for indications of the model's goodness of fit. 
The results of these tests were compared, and the best model 


was chosen. 
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LV. RESULTS 


HiemeestteswOmerie parametric time series analyses were 
in good agreement with the spectral analyses done with the 
Same data by Davidson [1970]. The results clearly indicate 
Pacwetractabidaity of this type of data to this type of analy- 


Sis, 


Meee hRLODIC EFFECTS 

The vove difference operator was applied to the data for 
several values of d and D. Table I contains the values ap- 
Mered. From both spectral analysis and autocorrelation func- 
tion estimates, it was determined that the dominant period of 
the waves was best estimated as six seconds, corresponding to 
a period of 30 lags in the time series, while the air was 
Siserved to have a dominant period of only 4.7 seconds, cor- 
responding to 24 lags in the time series, in both the hori- 
zontal and vertical components. The estimated autocorrelation 
Functions of all three time series before differencing, Figure 
3, shows the form of a damped sine wave, but the damping 1s 
not great. The autocorrelations at high lag numbers do not 
Pemormesclinetctently fast to Support an assumption that 
tiese autocorrelation functions have arisen from stationary 
Pie mochheowe Luncanebe seen in Figure 4 that application of 
Elie ie MMewieonndadenot result in a picture more compatible 


with the assumption that stationarity resulted. None of the 


differencings involving the vove @eperator were satisfactory 
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TABLE I 
DEGREES OF DIFFERENCING EXAMINED 
owes 
s 


: or [l-2cos(2T/p) Bt Bz, 


p=s=24 for Airflow; p=s=30 for Wave heights 


ur 110 . _d_ op. Se (LPS Can) eae 
ul 0 0 No No 
i 1 0 No No 
3 2 0 No No 
4 0 Ht 24/30 No 
> 0 2 24/30 No 
6 i i 24/30 No 
7 iL Z 2) O10 No 
8 Z 1 PE BS) No 
7) 2 2 24/30 No 
10 0 0 No Yes 
ILA a 0 No Yes 
2 Zz 0 No | Yes 
is 0 1 4 Yes 
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when D # 0. Applying the vo Aibomesuwhth d = 2, resulted in a 
Haetcatactouy pattern £6r this data (Figure 5). However, the 
difference between the ve operator and the non-general opera- 
more fl-2 cos (27/p)B + a] discussed in III-A-2, is smail 

for p=24 for the wind, and p=30 for the waves. The [1-2 cos 
(27/p)+ Bo] operator gives a slightly better result (Figure 6) 
midewltim the support from theory, the assumption that this 
Operator is the proper one seems justified. No value d # 0 
gave a better result than d=0 when the full To Olas Cavee 

= BS | operator was applied, so the final series considered was 


aa [1-2 cos (21/p) B + Ben z 


iWemnesmituthat no Local differencing was necessary (that 
mo LOr oo d=0), and that the best results arose from the 
epplication of the non-general operator which took out a 
Single sine pave. is precisely in agreement with the hypo- 
thesis concerning the organized motion considered in this 
paper. The fact that the frequency of the organized motion 
Mierte alr is mot the same as the dominant frequency of the 
waves is not in good agreement with the hypothesis considered, 
though it is in agreement with Davidson's spectral results 
PwarEdsommeleOlee the Lack of efficiency in the ve Cp emda ier 
Lore rodletvonmome stationarity was judged to be due to 
spurious periodicities in the random turbulent components of 


Ene. serves. 
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Dees kiA MODEL FITTING 

Once a satisfactorily stationary time series was produced, 
a closer examination of the estimated autocorrelation and es- 
momated partial autocorrelation functions reyealed that these 
functions showed unmistakable evidence that a mixed model, 
that is, one requiring both autoregressive and moving average 
terms, was required for each series. However, the three ser- 
jes each showed different structure, and were considered 
separately. 

1. Wave Model 

The estimated autocorrelation function of the waves 
was most easily diagnosed for indications of the proper model. 
Mitemmatitern exhibited is that of an ARMA (p,q) x (P,Q) where 
p=—O0, q=1, P=1, Q=1. The pattern was quite distinct, and over- 
fmepemine ee by betting p-l, g=2, P=2, q=2, resulted in estimates 
por che Py Pos 4» and ©, parameters which were close to 
zero, while no significant decrease in the sums of the Squares 
of the residuals was achieved. Fitting a smaller model, one 
with fewer parameters, resulted in a significant increase in 
the sum of the squares of the residuals. Hence, an ARMA (0,1) 
x (1,1) was considered best, and the final fitted model was 
30 


Chee (1-0, B) (1-@, B>”)a, 


Rie geek? esr Ill, @, =70.53, and ©, = -0.33. 


Jae U-Component Model 


tiemesehiumed amb EOocorrelation function of the u-com- 


ponent series did not immediately correspond to a particular 
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model in the way that the waves’ function did. No particular 
PociwmilOnnwassGCasonal component to the model, other than differ- 
encing, was suggested. Hence, P and Q were initially taken as 
zero, and subsequent fitting of ARMA (p,q) models showed that 
the best model of this type was the ARMA (1,1). The final 
fitted model of this type was, after overfitting in both local 
and seasonal parameters, 


(1-9, B)w, = (1-6) B)a, 


where 4 = -0.31, and OL = ote Va 

It was interesting to note that the estimated partial 
autocorrelation showed large values at periodic intervals. 
There seemed to be two periods involved, one of 24 and the 
other of 31. This is a possible indication that a seasonal 
autoregressive operator might be useful. Two such models 
were tried, one with s=24 and one with Saal: 

Sl 


(1-9, 8) (1-2, 8B we 


(1-0, B)a, 


and 


(1~$,B) (1-9, B°*)w, 


(1-6, B)a, 


Each of these models produced improvements in the diagnostic 
checks of the a's, but the improvement was slight enough to 
make it questionable if these are parsimonious operators. 
oe Conmomenlte Mode d 
tiem jet tec amateocorrelation £Lunction and the esti- 
Mated partial autocorrelation function of the w-component 
series showed no indication that seasonal parameters other 


than differencing were required. After overfitting in both 
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local and seasonal parameters, the ARMA (1,1) x (0,0) was 
judged best. The final model was 


(1-$,B)w, = (1-0, 8B) a 


where 4 = -0.09 and ony = -0.87. 


Cc. SUMMARY 

All three time series were most adequately reduced to 
feaeaonarity by application of a difference operator which 
removed a Hiswils sine wave, with adaptive amplitude and phase 
eoelbe, from the data. The period of the motion in the air 
thus removed differed from that of the waves, the period in 
the air being 4.7 seconds, as opposed to the 6.0 second 
dominant period in the waves. 

Once reduced to stationarity, the wave series was best 
Pepresented by an ARMA (0,1) x (1,1) model The stationary 
Series from the u-component was adequately represented by 
Serer sam eanMaA (151) x (1,0) or an ARMA (1,1) model. The w- 
component was best represented by an ARMA (1,1) model. 

Piemdatampraved tO be of the type for which parametric 
bime series analysis is useful. The results show good agree- 
Pei eimbey Lous results obtained by spectral analysis of 


the same data by Davidson [1970]. 
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Y. CONCLUSIONS 


iiesmecessity Lor removang a single sine waye from the 
PtieaeeO achieve Stability strongly suggests that there was a 
Significant sinusoidal component of organized motion in the 
air flow immediately over the waves during the period in which 
the data were gathered. That this sinusoidal motion has a 
dominant period which is different than that of the waves is 
MiesuippOrtive Of the theories which predict that the air 
[eee tavye Orpanized motion with the same period as the waves. 
Mowevenb sy li 2S noted that the periods in question differ only 
twas seconds, although this appears significant in this 
densely sampled time series, which had about 25 observations 
per period (observations every .2 seconds). 

The fact that none of the series reduced to a random walk 


model, that is, 


after being differenced to stationarity, suggests that either 
Pee peOoneanmmzed MOotLom exists, or that some sort of struc- 
ture is imposed upon the turbulence. Due to the fact that 
Weyer ouwecusoemites OL cach type was considered, the significance 
CepelcwWiemodetbo whieh were fitted to the properly differ- 
enced series is unknown. Too few series were considered to 
be able to generalize about the structure of the turbulent 
flow from the ARMA models derived here. The possibility 


omen totoetiatethe repeated fitting of such models may 
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result sin a formulation of a2 general model by an adaptive 
Sioa te 2Y 

The most important conclusion of this exploratory work 
is that parametric time series analysis may be applied to 
wWind-wave data with a reasonable expectation that results of 
value can be obtained. [ft is noted in this respect that 
this analysis closely agreed with the traditional spectral 
approach in their region of overlap. The results in this 
Study show that the examination of the estimated autocorrela- 
tion and partial autocorrelation functions of various degrees 
of differencing of the original series may be of value in 
understanding the structure of such series, even if the actual 
meet ines Of the ARMA model to the resulting series is not 
attempted. In short, the parametric approach appears to hold 
considerable potential in the analysis of organized motion 


Within turbulent boundary layer data. 
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Per eNDhR A 


ModelenaentLitcation Considerations 


Micwtiieialbedaentiii@eation of the proper form of the ARIMA 
to best fit the data is based on examination of the estimated 
memocorrelaAtion function and the estimated partial autocorre- 
Mmerroue function of the Series. There are two distinct steps 
mmolved: identifying the degree of differencing necessary and 
identifying the resultant ARMA model. These steps are des- 
cribed briefly below; they are more fully developed in Box 
emde Jenkins (1970, pp. 174-207; 300-334]. 

iG@ueaitnstmsaeepmin tnataal model identification is deter- 
Mire the degree of differencing necessary to produce a 
Stationary time series from the raw data. Thus one is attempt- 
ing to estimate the proper Bite’: of D and d in the vove 
Operator. The distinctive characteristic of a non-Stationary 
Eime series for identification purposes is the fact that the 
“MeocOrbe lation rinetien does not go quickly to zero with 
increasing lag number. The autocorrelations may decrease 
with lag number, but the decrease will be closer to linear 
than the exponential damping expected with a stationary time 
Dae omer UL tier erecUumnent patterns spaced a fixed distance 
in time (at a fixed number of lags apart) may indicate the 
To (mmomescaconalediliferencimg, if the autocorrelation co- 
Tiwiometteomoonnot fall ofr scuificiently fast. Hence, one 
examines first the estimated autocorrelations of the original 


data, and then successive degrees of differencing, both 


DZ 





Seeeonaleand docal,; until a pattern consistent with a sta- 
tionary series is found. 

Mmemcecand steppers to examine the autacorrelations aris- 
Maeemrrom the properly differenced series for indications of 
mitemresulting ARMA model to fit to the differenced series. 
One is aided in this by the knowledge of the behavior of the 
mmeoecorrelation and partial autocorrelation functions for 
Memtous models, Ihe theoretical autocorrelation function of 
an AR(p) model tails off and the partial autocorrelation 
mimebion abruptly goes to zero after p lags. The theoretical 
miEocorrelation function of a MA(€q) process goes abruptly to 
mweoeatter gq tags, while the partial autocorrelation function 
meets Off. Both functions tail off in a mixed model. The 
estimated functions will generally follow the theoretical 
functions. However, the estimates have large variances, and 
Biomeuiatecwenmecdsmot be close. in general, it is possible to 
meoaguce AN autoOcOVariance generating function for any given 
fewel, this function can be used to show the form of the 
mieGmortealeaurocorrelation for any model. One can then com- 
mawe the Estimated autocovariance (or autocorrelation) func- 
Bion, to the theoretical possibilities to find a reasonable 
match. 

PiesuMiae yee ncmcokimated atitocorrelation and partial 
Zine derOoneivwmer bons ane examined for characteristic 
aso ene nNdtecate athe proper degree of differencing 
and the proper ARMA model (to be fit to the properly differ- 


enced Series). Due to the variability of such estimated 


ois 


Miterionmc, precise definition Of model is often impossible, 
PMmeclemat ii beandeexpertence Of the analyst may be signifi- 


Gant in Obtaining the best results. 
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APPENDIX B 


Diagnostic Checks of Model Adequacy 


In order to insure that the model which has been fit is 
adequate, it is useful to have available diagnostic procedures 
for testing the adequacy of a given model. Such tests may be 
mased on both evaluations at the total model level and at the 
level of the residuals alone. ewe vas diagnostic tests which 
were used eee eee below; these tests, and others, are 
meres tully described in Box and Jenkins [1970, pp. 287 ff]. 

PGimeanyvedatachostmescieck son the total model level is 
the comparison of the magnitudes of the sums of the squares 
Mme restduals between models. A significant decrease in 
this value with a Sere immoceleindicates that’ the original 
model was not adequate. A Second check on the whole model 
level is overfitting of parameters. If one judges, for exan- 
peece that an ARIMAS(p,d,q) is adequate while noting that if 
it were inadequate, it would likely be inadequate in the AR 
Parameter, then one might fit an ARIMA (p+l1,d,q) model. One 
Pould then compare results of original model and the over- 
Preeted model for tndications of the original model's adequacy. 
Pictemsc, tt ne SlLeniiilcane change in fit occurred, as would 
be the case if the best estimates of the extra parameters in 
the overfit model were all close to zero, then one is re- 
assured as to the adequacy of the model. 

To overcome the need for the prior knowledge of the sus- 


Meeecdmdettco1encies in the model, such prior knowledge being 
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implied in overfitting, it is possible to use diagnostic 
checks on the residuals generated by the model. The auto- 
Momeat ton function, Or equivalently the spectrum, of the 
residuals can be examined for indications that the residuals 
generated do not approximate white noise. Individual auto- 
correlations may be tested for significant deviation from a 
value of zero by Student-t tests, using the appropriate limits 
for estimated residuals rather than those for white noise. 
Finally, the first K autocorrelations of the residuals may 
be looked at as a whole. The value Q such that 

K 


Q=N E x (a), 
m=1 


where ee GDepacmenemrccidual autocorrelation coefficient at 
fiend, and NN — number of observations less d, the differenc- 
ing parameter, is distributed as Vo (Ree eels Here, the 
P,q,P, and Q are as before, and M=l1 if the mean were removed 
from the initial series before the analysis, and M=0 otherwise. 
Hemee, the final test discussed here is an eer oonness Of Eve 
test. 
In summary, tests may be made either between models or 

oP nmemodelomtondetermine the adequacy of models. Both types 


Mapes ce wetreuucedetu Ene analysis presented in this paper. 
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